Goto

Collaborating Authors

 Orange


Becoming a Centenarian

The New Yorker

Like The New Yorker, I was born in 1925. Somewhat to my surprise, I decided to keep a journal of my hundredth year. The author, who was born on December 17, 1925, notes that the magazine's first issue came out ten months before he did. Old age is no joke, but it can feel like one. You look everywhere for your glasses, until your wife points out that you're wearing them. I turn a hundred this year. People act as though this is an achievement, and I suppose it is, sort of. Nobody in my family has lived this long, and I've been lucky. I'm still in pretty good health, no wasting diseases or Alzheimer's, and friends and strangers comment on how young I look, which cues me to cite the three ages of man: Youth, Maturity, and You Look Great. On the other hand, I've lost so many useful abilities that my wife, Dodie, and I have taken to calling me Feebleman. Look, up in the sky! No, it's Dodie doesn't want me to know how old she is, but she's nearly three decades younger than I am, and I become ...


Linguistic Knowledge Transfer Learning for Speech Enhancement

arXiv.org Artificial Intelligence

Linguistic knowledge plays a crucial role in spoken language comprehension. It provides essential semantic and syntactic context for speech perception in noisy environments. However, most speech enhancement (SE) methods predominantly rely on acoustic features to learn the mapping relationship between noisy and clean speech, with limited exploration of linguistic integration. While text-informed SE approaches have been investigated, they often require explicit speech-text alignment or externally provided textual data, constraining their practicality in real-world scenarios. Additionally, using text as input poses challenges in aligning linguistic and acoustic representations due to their inherent differences. In this study, we propose the Cross-Modality Knowledge Transfer (CMKT) learning framework, which leverages pre-trained large language models (LLMs) to infuse linguistic knowledge into SE models without requiring text input or LLMs during inference. Furthermore, we introduce a misalignment strategy to improve knowledge transfer. This strategy applies controlled temporal shifts, encouraging the model to learn more robust representations. Experimental evaluations demonstrate that CMKT consistently outperforms baseline models across various SE architectures and LLM embeddings, highlighting its adaptability to different configurations. Additionally, results on Mandarin and English datasets confirm its effectiveness across diverse linguistic conditions, further validating its robustness. Moreover, CMKT remains effective even in scenarios without textual data, underscoring its practicality for real-world applications. By bridging the gap between linguistic and acoustic modalities, CMKT offers a scalable and innovative solution for integrating linguistic knowledge into SE models, leading to substantial improvements in both intelligibility and enhancement performance.


Eliciting Critical Reasoning in Retrieval-Augmented Language Models via Contrastive Explanations

arXiv.org Artificial Intelligence

Retrieval-augmented generation (RAG) has emerged as a critical mechanism in contemporary NLP to support Large Language Models(LLMs) in systematically accessing richer factual context. However, the integration of RAG mechanisms brings its inherent challenges, as LLMs need to deal with potentially noisy contexts. Recent studies have shown that LLMs still struggle to critically analyse RAG-based in-context information, a limitation that may lead to incorrect inferences and hallucinations. In this paper, we investigate how to elicit critical reasoning in RAG via contrastive explanations. In particular, we propose Contrastive-RAG (C-RAG), a framework that (i) retrieves relevant documents given a query, (ii) selects and exemplifies relevant passages, and (iii) generates explanations that explicitly contrast the relevance of the passages to (iv) support the final answer. We show the impact of C-RAG building contrastive reasoning demonstrations from LLMs to instruct smaller models for retrieval-augmented tasks. Extensive experiments demonstrate that C-RAG improves state-of-the-art RAG models while (a) requiring significantly fewer prompts and demonstrations and (b) being robust to perturbations in the retrieved documents.


Improving Retrieval Augmented Language Model with Self-Reasoning

arXiv.org Artificial Intelligence

The Retrieval-Augmented Language Model (RALM) has shown remarkable performance on knowledge-intensive tasks by incorporating external knowledge during inference, which mitigates the factual hallucinations inherited in large language models (LLMs). Despite these advancements, challenges persist in the implementation of RALMs, particularly concerning their reliability and traceability. To be specific, the irrelevant document retrieval may result in unhelpful response generation or even deteriorate the performance of LLMs, while the lack of proper citations in generated outputs complicates efforts to verify the trustworthiness of the models. To this end, we propose a novel self-reasoning framework aimed at improving the reliability and traceability of RALMs, whose core idea is to leverage reasoning trajectories generated by the LLM itself. The framework involves constructing self-reason trajectories with three processes: a relevance-aware process, an evidence-aware selective process, and a trajectory analysis process. We have evaluated our framework across four public datasets (two short-form QA datasets, one long-form QA dataset, and one fact verification dataset) to demonstrate the superiority of our method, which can outperform existing state-of-art models and can achieve comparable performance with GPT-4, while only using 2,000 training samples.


Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness

arXiv.org Artificial Intelligence

Are the longstanding robustness issues in NLP resolved by today's larger and more performant models? To address this question, we conduct a thorough investigation using 19 models of different sizes spanning different architectural choices and pretraining objectives. We conduct evaluations using (a) OOD and challenge test sets, (b) CheckLists, (c) contrast sets, and (d) adversarial inputs. Our analysis reveals that not all OOD tests provide further insight into robustness. Evaluating with CheckLists and contrast sets shows significant gaps in model performance; merely scaling models does not make them sufficiently robust. Finally, we point out that current approaches for adversarial evaluations of models are themselves problematic: they can be easily thwarted, and in their current forms, do not represent a sufficiently deep probe of model robustness. We conclude that not only is the question of robustness in NLP as yet unresolved, but even some of the approaches to measure robustness need to be reassessed.


Conversation Style Transfer using Few-Shot Learning

arXiv.org Artificial Intelligence

Conventional text style transfer approaches focus on sentence-level style transfer without considering contextual information, and the style is described with attributes (e.g., formality). When applying style transfer in conversations such as task-oriented dialogues, existing approaches suffer from these limitations as context can play an important role and the style attributes are often difficult to define in conversations. In this paper, we introduce conversation style transfer as a few-shot learning problem, where the model learns to perform style transfer by observing only a few example dialogues in the target style. We propose a novel in-context learning approach to solve the task with style-free dialogues as a pivot. Human evaluation shows that by incorporating multi-turn context, the model is able to match the target style while having better appropriateness and semantic correctness compared to utterance/sentence-level style transfer. Additionally, we show that conversation style transfer can also benefit downstream tasks. For example, in multi-domain intent classification tasks, the F1 scores improve after transferring the style of training data to match the style of the test data.


Pandemic Control, Game Theory and Machine Learning

arXiv.org Artificial Intelligence

Game theory has been an effective tool in the control of disease spread and in suggesting optimal policies at both individual and area levels. In this AMS Notices article, we focus on the decision-making development for the intervention of COVID-19, aiming to provide mathematical models and efficient machine learning methods, and justifications for related policies that have been implemented in the past and explain how the authorities' decisions affect their neighboring regions from a game theory viewpoint.


NEWS: ROSS Intelligence Partners with Sedgwick LLP โ€“ ROSS' #LegalTech Corner

#artificialintelligence

ROSS Intelligence and Sedgwick LLP are pleased to announce their partnership. Sedgwick will bringing the power of ROSS Intelligence's advanced artificial intelligence (AI) to its Creditors' Rights and Bankruptcy group. "We are excited to be on the cutting edge of new technology to provide superior and more efficient service to our clients," said Mike Healy, Sedgwick Chair of the Firm. "Our firm's Creditors' Rights and Bankruptcy Practice is already using AI to enhance our legal research, making it faster to analyze complex legal issues and to discover relevant'connect-the-dots' information that otherwise might go unnoticed. This technology will allow us to best represent, protect and advance the interests of our clients."


Paralyzed woman controls robot arm with mind

AITopics Original Links

Using only her thoughts, a Massachusetts woman paralyzed for 15 years directed a robotic arm to pick up a bottle of coffee and bring it to her lips, researchers report in the latest advance in harnessing brain waves to help disabled people. In the past year, similar stories have included a quadriplegic man in Pennsylvania who made a robotic arm give a high-five and stroke his girlfriend's hand, and a partially paralyzed man who remotely controlled a small robot that scooted around in a Swiss lab. But will the experimental brain-controlled technology ever help paralyzed people in everyday life? Experts in the technology and in rehabilitation medicine say they are optimistic that it will, once technology improves and the cost comes down. The latest report, which was published online Wednesday in the journal Nature, comes from scientists at Brown University, the Providence VA Medical Center in Rhode Island, Harvard Medical School and elsewhere.


Home Page of the Loebner Prize

#artificialintelligence

What is the Loebner Prize? The Loebner Prize for artificial intelligence ( AI) is the first formal instantiation of a Turing Test. The test is named after Alan Turing the brilliant British mathematician. Among his many accomplishments was basic research in computing science. In 1950, in the article Computing Machinery and Intelligence which appeared in the philosophy journal Mind, Alan Turing asked the question "Can a Machine Think?"